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Abstract 

We prove that the generalized Thue-Morse word tj m defined for b > 2 and 
m > 1 as tb i?n = (sf,(n) mod m)^^, where Sb(n) denotes the sum of digits in the 
base-6 representation of the integer n, has its language closed under all elements of 
a group D m isomorphic to the dihedral group of order 2m consisting of morphisms 
and antimorphisms. Considering simultaneously antimorphisms G E D m , we show 
that t^m is saturated by 0-palindromes up to the highest possible level. Using the 
terminology generalizing the notion of palindromic richness for more antimorphisms 
recently introduced by the author and E. Pelantova, we show that tb jTn is L> m -rich. 
We also calculate the factor complexity of tb, m - 

1 Introduction 

Palindrome is a word which coincides with its reversal image, or more formally, w = R{w), 
where w is a finite word and the reversal (or mirror) mapping R : A* > A* is defined 
by R(wiiu 2 ■ ■ ■ w n ) = w n w n _i...Wi for letters Wi G A, A being an alphabet. In j8], 
the authors gave an upper bound on the number of palindromic factors in a finite word: 
a finite word of length n contains at most n + 1 palindromic factors. If this bound is 
attained, we say that a word is rich in palindromes (see j9], or jl], where such a word is 
called full). This definition can be naturally extended to infinite words: an infinite word 
is rich in palindromes if every its factor is rich in palindromes. For infinite words with 
language closed under reversal, i.e., containing with every factor also its reversal image, 
there exist several equivalent characterizations of palindromic richness. Each of them can 
be adopted as definition. 

Let us list three of these characterizations. An infinite word u with language closed 
under reversal is rich if one of the following statements holds: 

1. any factor w of u of length n contains exactly n + 1 palindromic factors; 

2. for any n G N, the equality AC(n) + 2 = V(n) + V(n + 1) is satisfied, where 
AC(n) denotes the first difference of the factor complexity of u and V(n) denotes 
the palindromic complexity of u ([5]). 
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3. each complete return word of any palindrome occurring in u is a palindrome as well 

(0); 



Let us mention that the inequality 



AC(ra) + 2 > V{n) + V{n + 1) 



(1) 



is valid for any infinite word u with language closed under reversal and for any n (see [2]). 
Thus, both characterizations[T]and[2]express that a word u rich in palindromes is saturated 
by palindromes up to the highest possible level. Arnoux-Rauzy words (among them 
Sturmian words) and words coding interval exchange transformation with the symmetric 
permutation belong to the most prominent examples of rich words. 

If we replace the reversal mapping R by an antimorphism 0, we can define 0-palindromes 
as words which are fixed points of 0, i.e., w = Q(w). For any antimorphism 0, the notion 
of ©-palindromic richness can be introduced and an analogy of characterizations 1, 2 and 
3 mentioned above can be formulated, see |12) . 

Although the language of the Thue-Morse word is closed under two antimorphisms, it 
is not 0-rich for any of these two antimorphisms. The author together with E. Pelantova 
in [10] explored infinite words with language closed under more antimorphisms simulta- 
neously. For a given finite group G formed by morphisms and antimorphisms on .4*, the 
words with language closed under any element of G are investigated. If, moreover, such a 
word u is uniformly recurrent, then a generalized version of the inequality ([T|) is proved: 
there exists an integer N such that 



where G^ 2 ' denotes the set of involutive antimorphisms of G and Ve is the ©-palindromic 
complexity, i.e., Ve(n) counts the number of ©-palindromes of length n in the language 
of u. 

Infinite uniformly recurrent words with language closed under all elements of G and 
for which the equality in ([2]) is attained for any n greater than some integer M are called 
almost G-rich. Let us emphasize that if the group G contains besides the identity only the 
reversal mapping R, the notion of almost richness (as introduced in [§]) and the notion 
of almost G-richness coincide. In [TU], G -richness is also introduced. Again, in case 
of G = {R, Id} it coincides with classical palindromic richness. The definition requires 
further notions, thus we omit it here and restrict ourselves to the following criterion of 
G-richness. 

Proposition 1. An infinite word with language closed under all elements of G is G-rich 



• for any two antimorphisms @i,©2 G G and any non-empty v G C(u) it holds 




for any n > N 



(2) 



eeG( 2 ) 



if 



e x ^ @ 2 0i (v) ^ e 2 (v), and 
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• for any two morphisms ifi,(p2 G G and any non-empty v G £(u) it 
^1 ^2 ^ 7^ V2(f ), and 

• the equality (j2j) is attained for all n > 1 . 

In [10], the authors show that the Thue-Morse word is G-rich, where G is a group 
generated by the reversal mapping and the antimorphism exchanging and 1. The class 
of so-called generalized Thue-Morse words is also partially treated in the article, but 
the question of their G-richness is not resolved. In this article we give a proof that all 
generalized Thue-Morse words are G-rich and we give explicitly the group G. 

The generalized Thue-Morse sequences were already considered by E. Prouhet in 1851, 
see j 1 1 j . Let s&(n) denote the sum of digits in the base-6 representation of the integer n, 
for integers b > 2 and m > 1. The generalized Thue-Morse word t& i?n is defined as 

t&, m = (> 6 (n) mod m)+= ■ 

Using this notation, the famous Thue-Morse word equals t2,2- The word tfe m is over the 
alphabet {0, 1, . . . , m — 1} = Z m . Similarly to the classical Thue-Morse word, also t& jTn 
is a fixed point of a primitive substitution, as already mentioned in pQ. It is easy to see 
that the substitution fixing the word t& iTn is defined by 

<Pb,m(k) = k(k + l)(k + 2) . . . (k + (b - 1)) for any k G Z m , 

where the letters are expressed modulo m. As already stated in [JJ, it can be shown that 
t& )}n is periodic if and only if b = 1 mod m. 

We show that for any parameters 6 and m the language of the word t& )m is closed 
under all elements of a group, denoted D m , isomorphic to the dihedral group of order 2m, 
see Proposition Q, and that t& ;m is D m -rich, see Theorem @. In the last section, we use 
the results to give the formula for the factor complexity of t& >m . 

2 Preliminaries 

An alphabet A is a finite set, its elements are called letters. A finite word over A is a 
finite string w = W1W2 ■ ■ ■ w n of letters iOj G ^4. Its length is \w\ = n. The set A* is formed 
by all finite words and it is a free monoid with the empty word e as neutral element. An 
infinite word u = (tti)^o is a sequence of letters Uj G A A word i> G ^4* is a factor of a 
word w; (finite or infinite) if there exist words s, t G A* such that w = svt. If s = e, then i> 
is a prefix of u;, if t — e, then f is a suffix of w. An integer i such that v = Wi . . . w i+ \ w \x 
is called an occurrence of v in iw. We say that an infinite word is uniformly recurrent if 
for each factor the gaps between its successive occurrences are bounded. 

By £ n (u) we denote the set of factors of u of length n. The set of all factors of u is 
denoted by £(u) and is called the language of u. The factor complexity C of an infinite 
word u is a mapping N 4 N counting the number of distinct factors of a given length, 
i.e., C{n) = #£ n (u). 
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A letter a G A is a left extension of a factor u; G £(u) if aw G £(u). If a factor has 
at least two distinct left extensions, then it is said to be left special. The set of all left 
extensions of w is denoted Lext(w). The definition of right extensions, Rext(w) and right 
special is analogous. A factor which is left and right special is called bispecial (BS). 

The bilateral order b(w) of a factor w is the number b(w) := #Bext(to) — #Lext(w) — 
#Rext(u>) + l where the quantity Bext(u') = {awb G £(u) | a, b G A}. In [6], the following 
relation between the second difference of factor complexity and bilateral orders is shown: 

C(n + 2) -2C(n + l) +C(n) = A 2 C(n) = b(w). (3) 

One can easily show that if w is not a bispecial factor, then b(w) = 0. Thus, to enumerate 
the factor complexity of an infinite word u one needs to calculate C(0), C(l), and bilateral 
orders of all its bispecial factors. 

A mapping (p on A* is called a morphism if (p{vw) = (p(v)ip(w) for any v,w E A*; an 
antimorphism if = (p(w)tp(v) for any t>,u> G .A*. By v4M(A*) we denote the set of 

all morphisms and antimorphisms over A*. Let v G AM(A*). We say that £( u ) * s closed 
under v if for all «; G £(u) we have ^(ty) G £(u). 

It is clear that the reversal mapping R is an antimorphism. Moreover, it is an involu- 
tion, i.e., R 2 = Id. A fixed point of an antimorphism G is called -palindrome. If = R, 
then we say palindrome or classical palindrome instead of i?-palindrome. The set of all 
©-palindromic factors of an infinite word u is denoted by Pale(u). The -palindromic 
complexity of u is the mapping P e : N i— > N given by Ve(n) = # (Pale(u) PI £ n (u)). If 
a G A, w G Pale(u), and awQ(a) G C(u), then awQ(a) is said to be a -palindromic 
extension of w in u. The set of all ©-palindromic extensions of w is denoted by Pexte(w)- 

3 Generalized Thue-Morse words and dihedral groups 

In this section we show that £(t& jm ) is closed under all elements of an explicit group 
G C AM (A*). Fix b > 2 and m > 1. In what follows, to ease the notation, we denote 
V? = fb,m, the alphabet is considered to be Z m , and letters are expressed modulo m. 
For all x G Z m denote by ^ the antimorphism given by 

^x(^) := x — k for all & G Z m 

and by n x the morphism given by 

H x (k) := x + k for all k G Z m . 

Denote by D m the set D m = {^ x | x G Z m } U {II X | x G Z m }. It is easy to show that 
D m is a group and can be generated by 2 elements, for instance one can choose II! and 
\EV Since the order of Hi is m, ^>q is an involution, and \J/ TIi is also an involution, D m 
is isomorphic to the dihedral group of order 2m. 

The following property of (p will help us to prove the next proposition. 
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Property I For all x G Z m , we have Ti x (p = <pH x and ty x ip = ip^f x+ i,-i- 

Proof. It follows directly from the definitions of </?, U x and □ 

Proposition 2. 27ie language oft^ m is closed under all elements of D m . 

Proof. We show the claim by induction on the length n of factors. For n = 1 the statement 
is easy to verify since Ci{t^ m ) = Z m . Suppose now the claim holds for factors of length n 
and take w G £n+i(t&,m)- It is clear that there exists a factor v, 1 < |i>| < n, such that u> is 
a factor of <£>(i>)- Let x G Z m , then using Property [J one has H x ip(v) = ipH x (v). Since we 
supposed Tl x {v) G £(t& jn j), it is clear that H x (w) is a factor of t& ifn Using again Property 
H]for ^/j;, we have also ty x tp(v) = ipty x+ b-i (v) , and thus ^ x (w) is a factor of t& ;m . □ 



4 Z^-richness of t& jm 

In this section we show that the word t& iTn is D m -rich. Let 7r : Z m i— > Z m denote the 
permutation defined for all k G Z m by 

7r(fc) = the last letter of <p{k) = k + b — 1 = 

Denote by q the order of 7r, i.e., the smallest positive integer such that q(b — 1) = 
mod m. We say that a factor t> = vqVi . . . v s _i G £(t& >rri ) is an ancestor of a factor 
w; G £(t& m ) if to is a factor of <^(v) and is not a factor of tp(vi . . . or <p(vo . . . v s _ 2 ). 
The following properties of tp and t^m can be easily deduced. 

Property II <p is uniform, i.e., for all k,£ G Z m , = |y?(^)| = 

Property III tp is bifix-free, i.e., for all k,£ G Z m , k ^ £, the first letter of 9?(/c) differs 
from the first letter of <p(£), and the same holds for the last letter. 

Property IV £ 2 (t&, m ) = {ir k (r — l)r \ r G Z m , < jfc < q - 1}. 

Proof. Denote by L := {(r — l)r | r G Z m }. It is clear that L C ^{^m)- 
For all z, denote by L i+1 the set of factors of length 2 of the words (p(w), 
w G Li. From the definition of n, it is clear that Lj = {7r fc (r — l)r | r G 
Z m ,0 < k < i}. The definition of g then guarantees L 9 _i = L g and the 
equality L q -\ = £2(t&,m) follows from the construction of the sets Lj. □ 

Property V £ 3 (t&,m) = 0^0 - + 1) I t G Z m , < k < q - 1} U {(t - l)tn~ k {t + 1) | 
t G Z m ,0 < k < q- 1}. 

Proof. It follows from Property IIVI and the definition of □ 

Property VI For all words w of length 1 or 2, there exists exactly one x such that w is 
a ^-palindrome. 
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Proof. It follows directly from the definition of ty x . □ 

Property VII If for x G Z m , the word w G £(t& im ) is a ^-palindrome, then the factor 
<p(w) is a \l/ x _b + i-palindrome. 

Proof. It follows directly from Property [H □ 

Property VIII Let w G £(t& jm ) be a BS factor and z/ G D m . Then u(w) is BS factor and 
b(w) = b(u(w)). 

Moreover, if it; is a Q-palindrome for some antimorphism G D m , then 
v(w) is a ©'-palindrome for some 0' G D m . 

Proof. Property IIIII and Proposition [2] guarantee the first part of the state- 
ment. The second part of the statement can be verified by setting 0' = 
uQu- 1 . □ 

Property IX If w = Wq . . . if s _i G C(tb tTn ) and there is an index i such that Wj+i 7^ tfj + l, 
then 10 has exactly one ancestor. 

Proof. It follows directly from the definition of </? and Property [Til Q 

Property X Let b ^ 1 mod m and to G £(tft ;m ). If |w| > 26, then w has exactly one 
ancestor. 

Proof. Take [it? j = 26+1. Suppose that there is no index i such that 
Wi+i 7^ Wi + 1, i.e., w = + 1) . . . (A; + 26) for some integer fc. Since 
every ancestor of w is of length 3, it implies that there exists a factor 
v G Csit^m) such that v = £(£ + b)(£ + 2b) for some i. Since 6^1 mod m, 
it is a contradiction with Property El D 

Property XI If w G £(tb, m ) is BS and \w\ > 6, then there exist letters x and y such that 
is a prefix of w and <p(y) is a suffix of w. 

Proof. The claim is a direct consequence of Property IIIII and the definition 
of tp. □ 

These properties are used to prove the next two lemmas. The first lemma summarizes 
the bilateral orders and ©-palindromic extensions of longer BS factors. 

Lemma 3. Let 6^1 mod m. Let w G £(t{, m ) be a BS factor such that \w\ > 26. Then 
there exists a BS factor v such that <p{v) = w. Furthermore, b(w) = b(v). 

If v is a Qi-palindrome, ©1 G D m , then there exists a unique ©2 G D m such that w is 
a ®2-palindrome. Moreover, #Pexte 2 (if) = #Pexte 1 (u). 
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Proof. Let wbea BS factor of length > 2b. 

If > 2b, the existence of a unique ancestor v follows from Property IXl If = 2b 
and if there exists an ancestor v G ^(t^), we have a contradiction with Properties IXII 
and [V] Thus, if \w\ = 2b, then there exists a unique ancestor v G £2(t& TO ) such that 
(p(v) = w. 

Property II III guarantees v is BS and b(w) = b(v). 

Suppose v is a \I/ x .-palindrome for some x G Z m . According to Property IVIH w is 
a ^/^.fe+i-palindrome. The fact that there is no other such antimorphism follows from 
Property IVIl 

The equality #Pextq, x (v) = =f£Pext\& x _ b+1 (w) follows again from Property IVIII □ 

Thanks to the last lemma, we have to evaluate only the bilateral orders and number 
of palindromic extensions of shorter factors. The next lemma exhibits these values for 
concerned lengths of BS factors. 

Lemma 4. Let 6^1 mod m. Let w be a BS factor of t& jm such that 1 < \w\ < 2b. 
Let G D m be the unique antimorphism such that w = Q(w). Then the values b(w) and 
#Pexte(w) are as shown in Table\j\. 





w 




b(w) 


#Pext e H 


1 < 


u 


< 6-1 





1 




w 




= b 


1 


2 


b+ 1 < | 


w\ 


<2b-2 





1 


\w 


= 2b-l 


-1 






Table 1: The bilateral order and number of palindromic extensions of a BS factor w of 
tb j7n , b ^ 1 mod m, according to its length \w\. G D m is the antimorphism such that 
Q(w) = w. 

Proof. Let w, < \w\ < 2b, be a BS factor and let s denote its length. It follows from 
Property |XT] that there exists k G Z m such that 

w = k(k + 1) . . . (k + s - 1). 

Since w = IL; (01 . . . (s — 1)), thanks to Property IVIIII we may take w — 01 . . . (s — 1). 

Let G D m be the unique antimorphism such that w = Q(w). From the form of w it 
follows that = \& s _!. 

We discuss the following cases distinguished by s. 

a) s = 2b - I. 

One can see that w has exactly 2 ancestors: the words Ob and (m— l)(b— 1). (Both Ob 
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and (to- 1)(6- 1) belong to £ 2 (t 6 , m ) since tt 9-1 ^- 1)6 = 06 and ^(6 - 2)(6 - 1) = 
(m — 1)(6 — 1).) The only pairs of letters x and y such that xwy G £(tf> >m ) are 
m — 1 + 6 — 1 and 26—1, and to — 1 and 6. Therefore, b(w) = 2 — 2 — 2 + 1 = —1. Since 
© = ^26-2; on e can see that no extension xwy is a O-palindrome, i.e., #Pexte(u>) = 0. 

b) 6 + 1 < s < 26 - 2. 

One can deduce that has 26 — s + 1 ancestors, namely (s — 2b + i)(s — b + i) for 
< i < 26 — s. (Again, all these words are factors of t& m since 7r g_1 (2 — 1) = i — b for all 
z.) The only extensions xwy appearing in C{t bm ) are (m — l)u>(s — 6+1), (m — l)u>s, 
and (m + 6 — 2)u;s. Thus, we have b(w) = 3 — 2 — 2 + 1 = and #Pexte(u>) = 1 
((to — l)u;s is a 0-palindrome) . 

c) s = b. 

The ancestors of w are the factors and (i — 6)i, for < i < b. The extensions 
xwy are (m — l)u>6, 7r £ ( — l)tyl , and (m + 6 — 2)u>7r~^(l), where < £ < q. Therefore, 
b(w) = 2q—q—q+l = 1. Since the only G-palindromes are (to— 1)u>6 and (m+6— 2)u>l, 
we have #Pexte(w) = 2. 

d) 2 < s < b- 1. 

The ancestors are the factors to — 1, . . . ,to + s — 6, the factor 0, and the factors (i — b)i, 
for < i < s. The extensions xwy are ir e (—l)ws and (— l)^7r~ f (s), where < £ < q. 
Thus, b(w) = (2q — 1) — q — g + 1 = and since (m — l)ws is the only 0-palindromic 
extension, we have #Pexte(w^) = 1- 

e) s = 1. 

One can see that the extensions are 7i^(— l)u>l and (— l)w7T (1), where < £ < g. 
Thus, b(u>) = and #Pexte(w) = 1 as in the previous case. 

□ 

Corollary 5. Let w be a non-empty BS factor oftb. m and 6^1 mod m. Then 

1. there exists a unique antimorphism O G D m such that Q(w) = w; 

2. b(w) = #Pext©H - 1. 
Theorem 6. The word tf, jm is D m -rich. 
Proof. First, let 6 ^ 1 mod to. We show that 

AC(n) + 2m = (v B {n) +V e (n + 1)) for all n > 1. (4) 

eeD m 

antimorphism 

Note that #D m = 2m. 

First, we show the relation (jlj) for n — 1. It is clear that C(l) = m and C(2) = qm. 
Thus, the left-hand side equals qm — m + 2m = qm + m. According to Properties [TV] and 
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EH it is clear that ^ V@{1) = m and ^ V e (2) = qm. Therefore, the 

eeD m eeD m 

antimorphism antimorphism 

right-hand side equals qm + m. 

To show the relation 01]), we are going to verify for all n > 1 that the difference of the 
left-hand sides for indices n + 1 and n equals the difference of the right-hand sides for the 
same indices. In other words, we are going to show that 

AC(n+l)-AC(n) = ^ {v e {n + 2) - V@{n)\ (5) 

©S-Drn 

antimorphism 

for all n. 

According to the equation ([3]), the left-hand side side can be written as 

A 2 C(n) = Yl b H 

wec n {t bi7n ) 

w BS 

and for the right-hand side we can use 

Pe(n + 2)-V e (n)= £ (#Pext e M - 1). 

«i6£„(ti, im ) 
«;=©(ui) 

Using the fact that a non-bispecial O-palindrome has exactly one 0-palindromic extension 
and Corollary 0, the relation (JSJ) holds. 

If tb /m is periodic, i.e., 6=1 mod m, the proof of the relation (E]) can be done in a 
very similar way and is left to the reader. 

Finally, according to Proposition [l] and Property I VI] t& >m is D m -rich. 

□ 

5 Factor complexity 

To our knowledge, the factor complexity of the Thue-Morse sequence t 2 2 was described 
in 1989 independently in [3] and [7] and of t 2 , m in [T3] . 

In the aperiodic case, to calculate the factor complexity, one can use ([3]) and Lemmas M 
and 0. Table Q shows the result: AC(n) and C(n). In the periodic case, the factor 
complexity is trivial: C(n) = m for all n > 0. 
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n 


AC(n) 


C(n) 





m — 1 


1 


1 


qm — m 


m 


2 < n < b 


qm — m 


qm(n — 1) — m(n — 2) 


b k + 1 + £ 
k> 1, < £ <b k - b k ~ x 


qm 


qm(n — 1) — m(b k — b k ^ 1 ) 


(2b- l)b k - L + l + £ 
k>l,0<£< b k+1 - 2b k + b k - x 


qm — m 


qm(n — 1) — m(6 fe — b^ 1 + 



Table 2: Values of AC(n) and C(n) of the generalized Thue-Morse word t^ m for the 
aperiodic case 6^1 mod m. 
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